
Title: The Local Influence of Pioneer Investigators on Technology Adoption: Evidence from New Cancer Drugs
Authors: Leila Agha and David Molitor

DATA

A. MEDICARE CLAIMS DATA
The files in this directory can be combined with Medicare claims data to replicate the analysis in this paper. 
Under the terms of our Data Use Agreement, we cannot share the Medicare claims data that underly this analysis.
Researchers may request access to Medicare claims data by contacting ResDAC:
https://www.resdac.org/cms-data/request/cms-data-request-center
This analysis relies on the 20% Carrier and 100% Outpatient Medicare claims from 1998-2008.

B. AUXILIARY DATA--PROVIDED IN DATA FOLDER
All other data sets not covered by our CMS DUA are provided in the data folder. 
We combined the Medicare claims with detailed information on cancer drug trials culled from public FDA releases and clinical trial publications. Table 1 in the paper summarizes the clinic trial information.
Medicare data is merged to PhysicianID.xlsx (provided here). PhysicianID.xlsx provides a list of authors, authorship position, zip code, NPI, UPIN, citations and publications. See the "notes" tab for more detailed variable definitions.

We also used a series of crosswalks to match zipcodes to the Dartmouth Atlas HRR and HSA definitions, and to define neighborhing HRRs and HSAs. These crosswalks can be found in the folder data/crosswalks.
Zipcodes downloaded from http://federalgovernmentzipcodes.us/ on 4/5/2012.
See FreeZipcodeDatabase.htm for info on these data.
Downloaded on recommendation from the Missouri Census Data Center.

REPLICATION CODE
The replication code proceeds in three steps. 
1.extract-restat.sas provides SAS code that calls the raw Medicare claims files and extracts the relevant claims. It also relies on files provided in data/auxiliary that list the file names for the raw Medicare claims files.
2.create-restat.do takes the extracted Medicare claims and creates our data set for analysis. It merges Medicare data to the drug trial information and geographic data described in 1B above.
3.analysis_figures_tables.do uses the final analysis data set to run regressions and create the tables and figures reported in the paper.



